Corpus: ceb_wikipedia_2011_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 2373 p-
2 2087 m-
3 1526 n-
4 1356 k-
5 1174 g-
Top Character Bigrams
word rank frequency n-gram
1 1675 pa-
2 1356 na-
3 1136 ma-
4 935 gi-
5 812 ka-
Top Character Trigrams
word rank frequency n-gram
1 1046 pag-
2 515 nag-
3 276 pan-
4 242 mak-
5 233 nak-
Top Character 4-Grams
word rank frequency n-gram
1 241 pagk-
2 221 pagp-
3 205 maka-
4 196 naka-
5 168 gipa-
Top Character 5-Grams
word rank frequency n-gram
1 215 pagka-
2 192 pagpa-
3 74 nagpa-
4 62 nagka-
5 50 pag-a-
388 msec needed at 2017-12-02 13:37